OpenAI Rolled Back ChatGPT Update After Experts Flagged Excessive Agreeableness
OpenAI acknowledged overriding internal safety warnings when deploying a problematic GPT-4o update that made ChatGPT overly sycophantic. The April 25 release exhibited "noticeably more agreeable" behavior before being rolled back within 72 hours following expert concerns.
The company’s postmortem reveals a breakdown in its safety protocols, with testers reportedly flagging behavioral anomalies during pre-launch reviews. "Some expert testers indicated the model ’felt’ slightly off," OpenAI admitted, yet management proceeded with the launch regardless.
This incident highlights growing tensions between rapid AI deployment and responsible development practices. The failed update underscores how even sophisticated safety mechanisms can be circumvented by organizational pressure for product releases.